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hese days, Intel 
^"^■^^J^l^is mainly known as 

^"^■^"r the company that, in 
concert with Microsoft, 
controls what goes on inside most 
PCs. Their commanding position has 
served them well, propelling them to 
the top of the 1C business. 

After a decade of PC and x&6 
hoopla, it's easy to forget Intel's earlier 
contributions at the dawn of the 
silicon revolution. Long before the 
x86, Intel invented a chip that is 
arguably even more important— the 
DRAM. 

Looking back at chips like the 
early '70s' Intel 1 103 (see Figure la} is 
always good for a laugh. Imagine what 
your typical 1-MB PC would be like if 
it had 8000 of these little puppies. 
Forget the mini- and maxi-tower 
enclosures; the Trump tower is more 
like it. You'd better beef up your 
power supply too since each 1 103 
burned a half watt or so. Roll-your- 
own precharge, inverted DOUT, and 
16-V I/O levels just add to the fun. 

Nevertheless, as feeble as it now 
seems, the 1 103 started the DRAM 



ball rolling down a slippery slope of 
ever-increasing density and |at least 
until recently) ever-falling prices. 
Now, with the DRAM market moving 
through 4 and 16 Mb to beyond, a few 
chips are enough to handle the most 
bloated software upgrades without 
blinking a bit. 

The modem DRAM era began in 
the late 70s with a 16-Kb DRAM such 
as the Intel 2116 illustrated in Figure 
lb. More important than the technical 
features, acceptance of a standard 
kicked off multisourced DRAM as a 
commodity. The subsequent brutal 
battles for sockets had great benefits 
for users, but drove many suppliers 
(including Intel) from the business. 

DRAM 101 

Though the 2116 came (and went) 
before the x86, most of its features 
persist to this day. Thankfully, a single 
supply replaces the three (+5, +12, and 
-5 V) of yesteryear, but otherwise 
DRAMs haven't changed that much. 

A typical 2»-bit DRAM is orga- 
nized as 2"' ! rows and 2° H columns. As 
shown in Figure 2a, the row address is 
strobed into the DRAM with RAS* 
(row-address strobe). Next, the column 
address is strobed with CAS* (column- 
address strobe), which also clocks the 
data in or out depending on the WE" 
pin. The timing parameter t^ denotes 
the access time of the DRAM (i.e., 
is70nsfora"70-nsDRAM"). 

To refresh your memory (ho-hol, 
DRAMs store each bit on a capacitor 
that leaks. Thus, they must be periodi- 
cally refreshed, usually a row or two 
(limited by power considerations) at a 
time. Typical specs call for the entire 
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The modern DRAM era started win 1S-KI> 



Tom 
reminds 
us that in 
the begin- 
ning, 

DRAM was created. And, 
after many days, Hitachi 
brought forth synchro- 
nous DRAM, which offers 
better timing, greater 
speed, and more 
bandwidth. Tom says that 
this is good. 
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Figure 2a— Basic DRAM read and 
wnte cycles use a RAS'-before-CAS' 
technique. The t w spec determines 
the access speed ot the DRAM. 



Read cycle 



DRAM to be refreshed 
hundreds of times per 
second, though those 
pesky capacitors often 
prove surprisingly more 
bitworthy than that. 

On average, a refresh 
cycle needs to occur every 
15 us or so (i.e., 1-Mb 
DRAM needs 512 refresh 
cycles every 8 ms, a 4-Mb 
DRAM requires 1024 
cycles every 16 ma, etc.). 
But, there's nothing that 
says you can't group all 
your refresh cycles in a 
burst. 

Figure 2b shows the 
popular types of refresh 
cycles. The first is simply 
to read a location in each row, a 
scheme well- suited to graphics 
applications or software-refresh 
schemes. The similar RAS '-only 
refresh cycle was the most widely used 
on the 16-, 64-, and 256-Kb DRAMs. 
Starting with 1-Mb DRAMs, CAS*- 
before-RAS* (also know as CBR) 
refresh was offered and has since 
become popular. With an integrated 
refresh-address counter, CBR doesn't 
require an external address. 

About the same time as the 1-Mb 
DRAMs came into existence, the 
formerly simple-minded DRAMs 
started to get brainy with the addition 
of high-speed access modes (shown in 
Figure 2c). All modes rely on the fact 
that once a row is accessed, it's 
possible to access different columns 
quickly, without requiring a new 
RAS" cycle. 

Page mode repeats only the CAS' 
cycle for each new column address, 
utting access and cycle time nearly in 
alf. Note that the column address 
need not be sequential. Nybble mode 
latches four columns for subsequent 
access by CAS*. Since access is always 
sequential, no column address is 
required, making nybble mode even 
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faster than page mode. Static-column 
mode, as the name implies, offers 
speedy SRAM-like access to the 
column depending solely on the 
address, not CAS*. 

Though a little slower than the 
others, page mode 
has proven the 
market favorite. 
Recently, a variant 
of page mode called 
Extended Data Out 
(EDO) has been 
introduced, which 
quickens page-mode 
cycles by leaving the 
output driver on 
between CAS* 
pulses. It looks like 
EDO is going to be 
widely adopted, 
further cementing 
page mode's position 
as the high-speed 
access mode of 
choice. 

Subsequently, 
DRAM technology 
has migrated into a 
variety of specialized 
ICs including video 



RAM, TV line memory, pseudo-SRAM 
(PSRAM), and so on. Nevertheless, the 
plain old, xl or x4 page -mode DRAM 
reigns supreme in the marketplace. 

Now, fueled by the insatiable 
demand for more bandwidth (for less S, 
of course), a whole raft of clever new 
DRAMs are crawling out of the lab. 
One particular model appears headed 
for success — thanks not only to good 
features, but also to the warm, fuzzy 
glow multisource standardization 
imparts. Say "Hello" to the new kid 
on the block, the synchronous DRAM, 
otherwise known as SDRAM. 

ROCK 'EM, CLOCK 'EM 

JEDEC (Joint Electron Device 
Engineering Council), a formerly mild- 
mannered international standards 
committee, has emerged as a powerful 
force in the memory market. 

Originally, they, like many 
standards organizations, reacted to 
market realities by documenting after- 
the-fact standards. However, as the 
demand for standardization has grown, 
their mediation and blessing has 
become very important. JEDEC played 
a major role in making commodity 
SDRAMs a reality by forging a peace 
agreement between headstrong 
contenders. 



RAS-only 
refresh 




Figure 2b- The Read and RAS'-only refresh methods, wnich require an address, 
are being supplanted by CAS' -before-RAS', which doesn I 
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Figure 2c— 01 the three high-speed 
access modes. Fast-Page mode is most 
widely used. 



Let's start by taking a 
look at one of the first 
SDRAMs out of the blocks, 
the 4-Mb, 66-MHz Hitachi 
HM5241605 (see Figure 3). 
Thanks to IEDEC standard- 
ization, most of the discus- 
sion applies to SDRAMs 
from other suppliers, though 
you've always got to be on 
the lookout for minor spec 
differences. 

The first thing you'll 
notice is it's wide (xl6), 
thankfully easing the 
frustration caused by the 
narrow xl and x4 DRAMs. So far, it 
looks like SDRAMs |4 Mb and 16 Mb| 
will initially be offered in x4, x8, x9, 
xl6, and xl8 configurations. Later, I 
expect we'll see (64 Mbl x32. 

SDRAMs reflect the modem 
downsizing trend in their use of TSOP 
(Thin Small Outline Package) surface- 
mount technology. Thanks to TSOP, 
the miniaturization possibilities are 
truly impressive— 1 /: MB (2 MB for 16- 
Mb SDRAM) fits in a 
footprint about the 
same as the old DIP 
(=•10x20 mm) and iust a 
little thicker (1.2 mm| 
than a business card. 

The easiest way to 
move bits faster with 
less power is to move 
them less far, so 
SDRAMs downsize the 
power supply to 3.3 V 
±10%. Now here's a 
place you need to check 
an individual SDRAM 
data sheet. It looks like 
some SDRAMs may 
achieve 5-V TTL 
compatibility by 
tweaking their AC 
specs. However, this 
capability may disap- 
pear in future-genera- 
tion SDRAMs with 





finer geometries. In any case, for the 
same speed and power reasons, 
whatever the SDRAM connects to is 
increasingly likely to be 3.3 V too. 

Otherwise, the pin names cer- 
tainly look familiar. There's RAS', 
CAS', WE', CE', and multiplexed 
addresses just like before. The data 
lines (I/O0-I/O15) are bidirectional, 
but the separate DIN and DOUT lines 
of the plain DRAMs were usually 
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Figure 3- r7ie Hitachi HM524 1605 synchronous DRAM might be considered a New Age DRAM. 



connected together on the PCB 
anyway. The SDRAM features separatt 
power for the memory array (V cc and 
V M ) and I/O pins |V CC Q and V K Q) to 
help isolate noise. 

DQMU and DQML are individual 
byte enables for the data bus, neces- 
sary to support an atomic, byte-write 
operation. Otherwise, a byte write 
would require a word read, byte mask, 
word write sequence. 

It's the clock pin 
(CLK) and its enable 
(CKE) that are most 
important. After all, 
that's what the S in 
SDRAM is all about. 
While a plain DRAM is 
kind of passive, sitting 
around waiting for 
someone to tickle RAS ' 
and CAS', an SDRAM 
is a rather nervous 
critter that wants to 
blast some data on 
every clock edge (see 
Figure 4|. With clock 
rates up to 66 MHz 
available now and with 
100 MHz iust around 
the comer, an SDRAM 
can handle the data 
rates demanded by 
superduper CPUs and 
see-whiz graphics. 
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Figure ^-Synchronous DRAMs rely on a high-speed clock te.g.,XortOO MHz) to time all commands and data 



ELECTRONIC BANKING 

Looking inside the SDRAM 
| Figure 5) reveals differences hidden 
behind the familiar-sounding pin 
names. The most obvious difference is 
.hat the memory array is split into two 
lanks (selected using A9). As we'll see 
later, this supports a ping-pong, bank- 
switching scheme which maximizes 
bandwidth. 

The other thing to note is that the 
familiar-sounding pin names — RAS*, 
CAS', WE', and so on— no longer 
actually connect to latches and buffers, 
but instead feed a control-logic block. 

It turns out these pins don't 
actually control the memory transfer 
(remember, CLK does that|, but should 
be interpreted as bits in an opcode as 
shown in Figure 6. The pins are 
sampled at each CLK cycle and inject a 
command into the SDRAM pipeline 
unless in a DESL (CS' high] or NOP 
IRAS', CAS', WE' high) state. The 
first command to get familiar with is 
MRS (Mode Register Set) since it 
defines the key SDRAM operating 
characteristics (see Figure 7). 



Burst length refers to how many 
data transfers and clocks are associated 
with each REAO or WRIT command, the 
choices being 1, 2, 4, 8, and full page 
(which, in this case, is 256). Shorter 
bursts end automatically, while the 
BST command terminates full-page 



bursts. Burst type refers to the address- 
ing order of the burst and is normally 
sequential, unless you're connected to 
a '486 or Pentium, which feature a 
unique(! | interleaved, cache line fill. 

Write mode offers the option of 
burst read and write or burst read with 
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Figure 5— inside, synchronous DRAMs are split into two banks tor maximum performance. Many of the control ptn< 
have familiar names, but now, they are little more than the opcodes for an on-chip logic that controls the actual 
memory access. 



single write. The latter mode may be 
useful in graphics applications (e.g., a 
burst read refreshing the CRT and 
single writes coming from the drawing 
processor) or to support a CPU with 
write -through cache. 

CAS* latency depends on how 
many clock cycles elapse between a 
request and transfer, with higher clock 



rates demanding more delay. For 
instance, a particular-speed SDRAM 
may achieve 15-ns clock cycle and 
access times with CAS* latency 2, but 
only 30 ns with CAS* latency 1. Figur 
8 summarizes the CAS* latency and 
burst-length options. 

As shown, a transfer is initiated 
with an ACTV (activate) command, 
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Figure 7— The MRS (Mode Register Set) command defines synchronous DRAM basic operating characteristics 
such as burst length and CAS' latency. 



which is pretty much like the RAS * 
cycle of old. In this case, it selects the 
bank (A9) and row (A0-A8|. This is 
followed, after the requisite delay 
,t RCD ), by a READoi WRIT command 
that specifies the column address (A0- 
A7) as in yesteryear's CAS*. After the 
previously specified CAS* latency, the 
data is blasted on each clock edge. 

The READ A, WRIT A, PRE, and 
PALL commands all have to do with 
precharge. Since precharge has been 
automatic since the days of the 21 16, 
:c might seem like a step backward to 
make the system designer handle it. 

Fortunately, if you'd rather not be 
bothered, the A versions of the READ 
and WRIT commands feature auto- 
matic precharge. However, as in plain 
DRAM, automatic precharge lengthens 
the basic cycle time. Fortunately, oven 
control of precharge in conjunction 
■vith the bank scheme does offer the 
opportunity for aggressive designers to 
hide precharge in one bank while 
transferring data from another. 

That about covers the SDRAM 
instruction set, except tor our old 
friend retresh, of which two variants 
are offered, depending on the state of 
the CKE pin. If CKE is high (i.e., the 
dock is enabled), an AUTO REFRESH 
mimics a plain DRAM's CBR-refresh 
cycle (i.e., an on-chip refresh counter 
means no external address is required!. 

SELF /?ff/?f5W|CKElow)iskind 
of a power-down mode, in which the 



DRAM keeps itself refreshed while 
consuming amazingly little power 
(only 2 mA compared to its 50-100- 
mA active consumption). It's tempting 
to just leave the SDRAM in SELF 
REFRESH between accesses. Unfortu- 
nately, however, exit from SELF 



REFRESH requires the issuance of 1024 
cycles of AUTO REFRESH to update the 
refresh counter, so SELF REFRESH is 
only suitable for serious napping. 

Figure 9, showing the allowed 
instruction sequences, looks compli- 
cated, but really isn't if you talk it 
through. When idle, the SDRAM 
should be refreshed 1024 cycles every 
16 ms. An ACTV command (think of it 
as a RAS* cycle) selects a bank and 
row for action. Once activated, the row 
can be accessed freely, bursting data 
both ways from random column 
addresses. After all transfers are 
completed (or if the row address 
changes), exit is via a precharge cycle, 
taking the SDRAM back to the idle 
and refresh state. 

Remember, the key point of the 
two-bank scheme is that each bank 
can be in a different state and the state 
transition intervals are faster between, 
rather than within, banks. For in- 
stance, the minimum ACTV-to-ACTV 
(i.e., cycle time) spec within a bank is , 
1 10 ns while between banks it's only 
30 ns. 
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figure 8 — T7)oug/r not shorn on the figure, remember that higher CAS' latency otters a taster dock rate. Snort-burst transfers automatically terminate, but full-page bursts mui 
be explicitly terminated att the BST command. 
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« your Big-Company 
come up with mote ideas than me ejMfeenng 

department can cope with? Are you a small com- 
pany that can't afford a full-time engineering staff 
for once-m-a-whjja, designs? 

Steve Clartia and the ClaSfiTDesfgrHlVorks 

staff may have the solution. WaJpHHEam of 
accomplishes programmers and engine-ready to 
design products or solve tnctcy engineering prob- 
lems. Whether you need an oq-Umj solution for a 
unique prabfemfb product forVstartup venture, 
or |ust experienced consuitin#fSf&faiOa.Design 
"'orks is ready to work withy 
'"■ .in touch. 
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❖ Connect via ROM Socket; DIP; PLCC; SMT 
•> Emulate ROMs up to 16MBit in Size 
•> Fastest Downloads Available: 
Parallel; Serial; Ethernet 
Run Industry Standard Debuggers 
Target Processor Independent 
Support 3Volt Targets 
Host Software Sources Included 
Shielded Cables for Reliable Operation 
30-day Money-Back Guarantee; 
1Year Warranty! 
•> Unlimited Phone Support; 24hr BBS 

Call Today 1-800-PROMICE 

(1-800-776-6423) 

Grammar Engine Inc. 

92 1 Easrwind Dr., Suite 1 22 • We-terville. OH 4308 1 
614/899-7878 • Fax 614/899-7888 
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Figure 9-*t synchronous DRAM is either idle or actively reading and nriting or precbargmg. 



BURST OR BUST 

You might think all this 66- and 
100-MHz talk implies some kind of 
10-times breakthrough in DRAM cell 
-peed. In fact, the speed of basic 
memory operation really isn't much 
different (witness the 1 10-ns cycle 
time mentioned above is about the 
same as plain DRAMsl. The SDRAM 
simply blazes a faster trail to what's 
otherwise regular DRAM. 

Thanks to the ping-pong feature, 
even the simplest SDRAM setup 
CAS - latency - 1, burst length - 1) 
tiers good potential. Note that the 
parallelism can be extended across 
multiple SDRAMs as well (i.e., you 
can also ping pong between chipsl. 



However, there's no doubt that 
bursty applications such as high-end, 
32-bit CPUs and graphics best match 
the features of SDRAM. The tradeoff 
between CAS* latency, clock rate, 
burst length, number of chips, and so 
on is hard to generalize. 

Graphics applications will likely 
shoot for maximum bandwidth le.g., 
CAS* length - 3, burst length - full 
page, CLK - 66 MHzl. It then becomes 
simply a matter of choosing the 
number of SDRAMs to support a given 
display. For example, a 1024 x 768 x 24 
x 72-Hz display calls for 170 MB per 
second, while each 66-MHz SDRAM 
offers 132 MB per second. Thus, a two- 
SDRAM frame buffer easily handles 




display refresh, leavT 
bandwidth for drawing 

CPU main-memory i 
are a little more tricky. A sin 
controller may just set the b 
equal to the cache-block-r< 
leave it at that. CAS ' latency versus"^ 
CLK rate largely depends on memory- 
controller speed, cost and complexity 
issues, and "hit ratio" tradeoffs (i.e., 
weighing greater latency penalty 
against higher burst bandwidth). 

More aggressive controller designs 
are possible if you're willing to sift 
through the entrails of your program 
and truly understand its behavior. It 
might be wise to put instructions and 
data in separate SDRAMs to take 
advantage of locality. Often, instruc- 
tion- and data-cache-refill sizes are 
different, so each SDRAM's burst 
length could be set appropriately. 

Another idea is to control burst 
length dynamically, either by overtly 
terminating full-page bursts or by 
reprogramming the mode register. This 
opens the door to really clever dy- 
namic or "greedy" burst schemes, 
which wring all possible bandwidth 
out of the SDRAM. 

No doubt, the prospect of rolling 
your own SDRAM controller, espe- 
cially a fancy one, is rather daunting. 
The good news is I expect SDRAM 
support will quickly migrate into 
likely partners including 32-bit RISC 
CPUs, graphics ICs, and PC chipsets. (J) 

Tom Camrell has been an engineer in 
Silicon Valley for more than ten years 
working on chip, board, and systems 
design and marketing. He can be 
reached at (510) 65 7-0264 or by fax at 
(510) 657-5442. 
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Hitachi America, Ltd. 
Semiconductor and IC Division 
2000 Sierra Point Parkway 
Brisbane, CA 94005-1819 
|415) 589-8300 
Fax: (4151 583-4207 
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